Track & Field
- North America > United States > Maine (0.04)
- North America > Canada > Newfoundland and Labrador > Newfoundland (0.04)
- Europe > United Kingdom > England > Staffordshire (0.04)
- Asia > East Asia (0.04)
- Transportation > Passenger (1.00)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Air (1.00)
- (6 more...)
- Europe > Switzerland > Zürich > Zürich (0.14)
- Asia > China > Shanghai > Shanghai (0.04)
- South America > Peru > Lima Department > Lima Province > Lima (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology (1.00)
- Leisure & Entertainment > Sports > Track & Field (0.92)
- Health & Medicine > Consumer Health (0.67)
- Law (0.94)
- Leisure & Entertainment > Sports > Track & Field (0.93)
- Health & Medicine > Consumer Health (0.68)
An Approach to Variable Clustering: K-means in Transposed Data and its Relationship with Principal Component Analysis
Saquicela, Victor, Palacio-Baus, Kenneth, Chifla, Mario
Abstract--Principal Component Analysis (PCA) and K-means constitute fundamental techniques in multivariate analysis. Although they are frequently applied independently or sequentially to cluster observations, the relationship between them, especially when K-means is used to cluster variables rather than observations, has been scarcely explored. This study seeks to address this gap by proposing an innovative method that analyzes the relationship between clusters of variables obtained by applying K-means on transposed data and the principal components of PCA. Our approach involves applying PCA to the original data and K-means to the transposed data set, where the original variables are converted into observations. The contribution of each variable cluster to each principal component is then quantified using measures based on variable loadings. This process provides a tool to explore and understand the clustering of variables and how such clusters contribute to the principal dimensions of variation identified by PCA. We analyze multiple data sets with varying variability structures (USArrests, Iris, Decathlon2) to show that the correspondence between clusters of variables and principal components depends on the data's inherent structure.
- North America > United States (0.14)
- South America > Ecuador > Azuay Province > Cuenca (0.04)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.48)
- Health & Medicine (0.47)
- Leisure & Entertainment > Sports > Track & Field (0.46)
- Law (0.94)
- Leisure & Entertainment > Sports > Track & Field (0.93)
- Health & Medicine > Consumer Health (0.68)
- Europe > Switzerland > Zürich > Zürich (0.14)
- Asia > China > Shanghai > Shanghai (0.04)
- South America > Peru > Lima Department > Lima Province > Lima (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology (1.00)
- Leisure & Entertainment > Sports > Track & Field (0.92)
- Health & Medicine > Consumer Health (0.67)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe (0.04)
- Leisure & Entertainment > Sports > Track & Field (1.00)
- Health & Medicine > Consumer Health (1.00)
- North America > United States > Texas > Loving County (0.04)
- North America > United States > Maine (0.04)
- North America > United States > Colorado (0.04)
- (3 more...)
- Transportation > Passenger (1.00)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Air (1.00)
- (6 more...)
- North America > United States > Indiana (0.05)
- South America (0.04)
- North America > United States > New York > Suffolk County > Stony Brook (0.04)
- (4 more...)
- Health & Medicine > Consumer Health (1.00)
- Leisure & Entertainment > Sports > Track & Field (0.94)
- Consumer Products & Services (0.94)
Feature Impact Analysis on Top Long-Jump Performances with Quantile Random Forest and Explainable AI Techniques
Gan, Qi, Clémençon, Stephan, El-Yacoubi, Mounîm A., Nguyen, Sao Mai, Fenaux, Eric, Jelassi, Ons
Biomechanical features have become important indicators for evaluating athletes' techniques. Traditionally, experts propose significant features and evaluate them using physics equations. However, the complexity of the human body and its movements makes it challenging to explicitly analyze the relationships between some features and athletes' final performance. With advancements in modern machine learning and statistics, data analytics methods have gained increasing importance in sports analytics. In this study, we leverage machine learning models to analyze expert-proposed biomechanical features from the finals of long jump competitions in the World Championships. The objectives of the analysis include identifying the most important features contributing to top-performing jumps and exploring the combined effects of these key features. Using quantile regression, we model the relationship between the biomechanical feature set and the target variable (effective distance), with a particular focus on elite-level jumps. To interpret the model, we apply SHapley Additive exPlanations (SHAP) alongside Partial Dependence Plots (PDPs) and Individual Conditional Expectation (ICE) plots. The findings reveal that, beyond the well-documented velocity-related features, specific technical aspects also play a pivotal role. For male athletes, the angle of the knee of the supporting leg before take-off is identified as a key factor for achieving top 10% performance in our dataset, with angles greater than 169°contributing significantly to jump performance. In contrast, for female athletes, the landing pose and approach step technique emerge as the most critical features influencing top 10% performances, alongside velocity. This study establishes a framework for analyzing the impact of various features on athletic performance, with a particular emphasis on top-performing events.
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Europe > France > Île-de-France > Paris > Paris (0.04)